# Multimodal Image Generation
Nexus Gen
Apache-2.0
Nexus-Gen is a unified model that combines the linguistic reasoning capabilities of large language models with the image generation capabilities of diffusion models
Text-to-Image
Transformers

N
modelscope
129
9
Omnigen V1
MIT
OmniGen is a unified image generation model supporting multimodal prompts, designed with simplicity, flexibility, and ease of use in mind.
Text-to-Image
O
BAAI
121
9
Omnigen V1 Fp8 E4m3fn
MIT
OmniGen is a unified multimodal image generation model capable of producing various types of images based on diverse instructions, without requiring additional plugins or preprocessing steps.
Text-to-Image
O
silveroxides
64
2
Lumina Mgpt 7B 512
Lumina-mGPT is a family of multimodal autoregressive models excelling in various vision and language tasks, particularly in generating flexible and realistic images from text descriptions.
Text-to-Image
L
Alpha-VLLM
1,185
4
Featured Recommended AI Models